The Segmentation of Speech
نویسنده
چکیده
This paper reports a phenomenon supporting the hypothesis that the emergence of structure in the evolution of language was a staged process. To develop a grammatical structure it seems necessary to first have discrete constituents which can be the building blocks of a hierarchical system. By analysing observed speech we show that the development of a linear sequence of grammatical constituents has its own advantage, before a possible next stage when constituents are integrated into a hierarchical structure. A stream of speech sounds has to be segmented to allow for breathing. This segmentation has further developed in a certain way that makes it easier for the hearer to decode than if it were not segmented, or if it were segmented in an arbitrary manner. Well known tools from Information Theory are employed to analyse the ease of decoding speech. Segmentation depends on prosodic discontinuities, such as pauses and intonation marked by tone unit boundaries. These discontinuities usually mark groups of words with some syntactic cohesion, such as phrases and clauses. We show that in a modern corpus of spoken language observed segmentation facilitates the effective transfer of information, while lack of segmentation or arbitrary segmentation imposed on a stream of words makes decoding less efficient. This supports the hypothesis that the necessary constituents of a grammatical structure may have evolved as a consequence of developments favouring more efficient decoding of a linear stream of spoken words. The source material for this investigation is taken from the prosodically marked up Machine Readable Spoken English Corpus (MARSEC).
منابع مشابه
Word segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملQuantitative Comparison of SPM, FSL, and Brainsuite for Brain MR Image Segmentation
Background: Accurate brain tissue segmentation from magnetic resonance (MR) images is an important step in analysis of cerebral images. There are software packages which are used for brain segmentation. These packages usually contain a set of skull stripping, intensity non-uniformity (bias) correction and segmentation routines. Thus, assessment of the quality of the segmented gray matter (GM), ...
متن کامل“ A Review : Different methods of segmenting a continuous speech signal into basic units ”
Speech is the medium through which human beings can communicate. Segmentation of speech is required for better speech recognition. Segmentation of speech can be done into basic units like words, phonemes or syllables. The two main methods used for segmentation of speech signals are manual segmentation and automatic segmentation. But manual segmentation is not favoured as it is tedious, time con...
متن کاملThe effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment
The present study was conducted with the aim of the effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment.The purpose of this study is an applied research and a real experimental study. The statistical population of the present study includes all people aged 14 to 16 who are enrolled in ...
متن کامل